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DISTANCE FUNCTIONS, CRITICAL POINTS, AND 
TOPOLOGY FOR SOME RANDOM COMPLEXES 

By OmER BOBROWSKllJ AND ROBERT J. AdLEeP 

Technion 

O ■ For a finite set of points P in R'*, the function dv ■ R"^ ->■ R+ 

Cn ' measures Euclidean distance to the set V. We study the number of 

critical points of d-p when V is random. In particular, we study the 
limit behavior of Nk - the number of critical points of dv with Morse 
index A; - as the number of points in V goes to infinity. We present 
nJ . explicit computations for the normalized, limiting, expectations and 

Cn ' variances of the Nk, as well as distributional limit theorems. We link 

these results to recent results in [lOl . Illl ] in which the Betti numbers 
m^ • of the random Cech complex based on V were studied. 

'^ I 1. Introduction. For a finite set V of points in M , of size \V\, let 

2 ' dp : W^ — )■ R"*" be the distance function for V, so that 



o 






(1.1) d-p (x) := in.m\\x — p\\2, xG 

peP 



r>d 



The main results of this paper provide considerable information about 



> 

[^ [ the asymptotic (in \V\) behavior of the critical points (defined below) of dp 

t^ ■ when V is random. While the critical points are, by themselves, intrinsically 

interesting, knowledge of their behavior also has immediate implications 
(via Morse theory) to the study of the topology of Cech complexes built 
over random point sets. 

To be a little more specific, our random sets will either be made up of n = 
1 7^1 independent samples from a common probability density / on R , or from 
a non-homogeneous Poisson process with intensity A^ = nf. Furthermore, 
}2j ■ most of what we shall have to say will concentrate on the distance function 

5t , in neighborhoods of radius r„ around V, when n — )• oo and r^ — )• 0. Denoting 

the number of critical points with index k appearing within distance r„ from 
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V by Nk.n in the random sampling case, and by Nk^n in the Poisson case, 
our main results give expressions for the normalized, asymptotic, means and 
variances of these variables, along with various distributional limit results. 
The limit distributions are of different kinds, and, depending on delicate 
relationships between d, k, Vn and n, provide limits that may be Gaussian, 
Poisson, or deterministic, while also exhibiting a range of critical phenomena. 
Our main results on critical points are described in detail in Section [3l 
However, before stating the results, we first need to describe precisely how to 
define the critical points, along with their indices, for the distance function. 
The difficulty lies in the fact that the distance function is not everywhere 
differentiable. We shall do this in the following section. 

In Section |4] we shall discuss the relationship between the A^^^^^ and N^^n 
and the Betti numbers of a special simplicial complex, the Cech complex, 
based on V. The homology of the Cech complex is closely related to the 
neighborhood set, or r„-tube around V, 

(1-2) Bn := U BrM, 

pev 

where Br„ (p) is the d-ball of radius e around p. What we shall see in Section 
I3]is that, if r„ is too small, then the individual balls in (jl.2p will generally 
fail to intersect, and the topology will be approximately that of a large 
number of disjoint, small balls. This is known as the dust phase. If r„ decays 
too slowly, then the balls will connect and the topology of Bn will be that 
of a single ball. At the (phase) transition Bn will have a percolative-like 
structure, and so we call this the percolation phase. Each of these phases 
exhibits different limit behavior, with even more subtle differences possible 
within phases depending on interactions between parameters. 

Translating our results about critical points into statements about the 
(algebraic) topological structure of S„, as n — )• oo, will also allow us to 
compare them to other results currently in the literature (primarily |lOl.llll]). 



The one comment that we already make at this stage, however, is that we can 
provide a much richer set of results for the asymptotic behavior of numbers 
of critical points than is currently available for the Betti numbers of these 
Cech complexes. Indeed, we can also provide some topological results via 
critical points that are not yet available with a direct topological approach. 
The remainder of the paper contains the proofs of the results in Sections 
[3] and HI These are organized in a number of sections and appendices so 
as to make them as user friendly as possible. Many of the proofs rely on 



techniques in the theory of random geometric graphs as developed in 17 1 



Finally, a few words on motivation. There is considerable current inter- 
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est in the study, from a topological, homological, point of view of random 
structures such as graphs and simplicial complexes. Some recent references 
are [2, \4, IS, la, llj, UM with two reviews, from different aspects, in [l|] and 
[Sj]. Many of these papers find their raison d'etre in essentially statistical 
problems, in which data generates these structures. An important example 



appears in the papers 15|, ll6|] which show that the homology of an unknown 
manifold can be recovered, with high probability, by looking at the homology 
of the union of balls around the points of random samples (or equivalently, 
at the homology of the Cech complex generated by the sampling points on 
the manifold) with or without additional noise. The homological theme of 
these papers, which considers manifolds as being 'close' if their homologies 
are the same, seems particularly promising for situations in which the mani- 
fold of interest is embedded in a space of much higher dimension that itself; 
i.e. in dimension reduction problems and in manifold learning. 

The approach adopted in this paper shares the motivation of the others 
listed above, but, as already noted, by adopting a Morse theoretic point 
of view based on critical points of the distance function, obtains a more 
internally complete theory. 

Acknowledgement. We thank Shmuel Wienberger for introducing us to this 
problem, as well Matthew Strom Borman and Yuliy Baryshnikov, along with 
Shmuel, for many useful discussion in the earlier stages of our work. 

2. Critical Points of the Distance Function. Critical points of 
smooth functions have been studied since the earliest days of calculus, but 
took on signficant additional importance following the development of Morse 
theory (e.g. IJjll^]) which tied them closely to the homologies of manifolds. 



a topic that we shall discuss briefly in Section HI At this point we note that if 
A^ is a nice (closed, differentiable) n-dimensional manifold, and / : A^ — )• M 
a nice (Morse) function, then a point c is called a critical point if V/(c) = 0. 
A non-degenerate critical point is one for which the Hessian matrix Hf{c) is 
non-singular. The Morse index k £ {0, 1, . . . , n} of a non-degenerate critical 
point c is then the number of negative eigenvalues of Hf{c). These points, 
along with their indices, provide one of the main links between differential 
and algebraic topology. 

Classical Morse theory does not directly apply to the distance function 
mainly because it is not everywhere differentiable. However, one can still 
define a notion of non-degenerate critical points for the distance function, 
as well as their Morse index, which we now do. Our arguments follow [7|, 
which we specialize to the case of the distance function. 

Given a set V of points in M , and defining the distance function d-p (II. ip . 
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we start with the local (and global) minima of d-p; viz. the points of V (where 
d-p = 0), and call these critical points with index 0. For higher indices, we 
have the following definition. 

Definition 2.1. A point c € M'' is a critical point of d-p with index 
1 < k < d ii there exists a subset 3^ of A; + 1 points in V such that: 

1- yy ^y '■ dp{c) = \\c — y\\, and, Vp G V\y we have ||c — p\\ > dp{p). 

2. The points in y are in general position (i.e. the k + 1 points of y do 
not lie in a (A: — l)-dimensional affine space). 

3. c E conv°(3'), where conv°(3') is the interior of the convex hull of y 
(an open fc-simplex in this case). 

The first condition implies that dp = dy in a small neighborhood of c. 
The second condition implies that the points in y lie on a unique {k — 1)- 
dimensional sphere. We shall use the following notation: 

(2.1) S{y) = The unique {k — l)-dimensional sphere containing y, 

(2.2) C{y) = The center of S{y) in R'^, 

(2.3) R{y) = The radius of S{y), 

(2.4) B{y) = The open bah in M"' with radius R{y) centered at C{y), 

Note that S{y) is a (k — l)-dimensional sphere, whereas B(y) is a d- 
dimensional ball. Obviously, S{y) C B{y), but unless k = d, S is not 
the boundary of B. Since the critical point c in Definition 12. II is equidistant 
from all the points in 3^, we have that c = C{y). Thus, we say that c is the 
unique index k critical point generated by the k + 1 points in 3^. The last 
statement can be rephrased as follows: 

Lemma 2.2. A subset y C V ofk+1 points in general position generates 
an index k critical point if, and only if, the following two conditions hold: 

CPl C{y) e conv°(3;), 

CP2 vnB{y) = fl>. 

Furthermore, the critical point is C{y) and the critical value is R{y). 

Figure [U depicts the generation of an index 2 critical point in M^ by subsets 
of 3 points. We shall also be interested in critical points c that are within 
distance e from V, i.e. dp{c) < e. This adds a third condition, 

CP3 R{y) < e. 

The following indicator functions, related to CP1-CP3, will appear often. 
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Fig 1: Generating a critical point of index 2 in M? (i.e. a maximum point). 
The small blue disks are the points of V. We examine three subsets of V: 
yi = {yi,y2,y3}, 3^2 = {yi,y5,ye}, and 3^3 = {2/7,2/8,^9}- S{yi) are the 
dashed circles, whose centers are C(3^j) = Cj. The shaded balls are B{yi), 
and the interior of the triangles are conv°(3^j). (1) We see that both C(3^i) G 
conv°(3^i) (ICPll) and p n B(3^i) = (fcfejl . Hence ci is a critical point of 
index 2. (2) (7(3^2) ^ conv°(3^2), which means that (]CPip does not hold, and 
therefore C2 is not a critical point (as can be observed from the flow arrows). 
(3) C(3^3) G conv°(3^3), so (fCPTl) holds. However, we have VnB{y3) = {p}, 
so ()CP2p does not hold, and therefore C3 is also not a critical point. Note 
that in a small neighborhood of C3 we have d-p = c?{p}, completely ignoring 
the existence of 3^3. 



Definition 2.3. Using the notation above, 
(2.5) h{y) 



(2.6) K{y) 

(2.7) g,{y,r) 



l{C(3^)Gconv°(3^)} 

h{y)i^o,e]{R{y)) 
h,{y)i{vnB{y) = (li} 



(ICPlll 

(inpT] 4 mp3i) 
(icpT] + rcp2] + rcP3]) 



3. Main Results. Let Xn = {Xi, . . . ,X„} a random sample of points 
in M from a distribution with density /, which we assume to be bounded. 
This assumption will remain in force throughout the paper, without further 
comment. Let Vn be a spatial Poisson process on R with intensity function 
^n = nf. Denote by C{n,k) and C{n,k), respectively, the sets of critical 
points with index k of dx„ and d-p^. Let {r„}^^ be a sequence of positive 
numbers with lim„^oo ^n = 0, and define 

Nk,n ■■= #{c e C{n, k) : dx^c) < r„}, Nk,n ■= #{c G C{n, k) : dpSc) < rn}- 

Our main goal is to study the limits of N^^n and N^^n as n — )• 00. Since 
A'^o = EjA'^o} = n (the minima are the points of Xn or Vn) we shall only be 
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interested in 1 < A; < d. The results split into three main regimes, depending 
on the rate of convergence of r„ to zero, specifically, on the limit of the term 
nr'il^. We shall state all the results in terms of N^^n- Unless otherwise stated, 
exactly the same results apply for Nj^^n- 

A word on notation: In the formulae presented below, for g : (]R'^)*''+^ — )• M 
and y = (yi,... ,yk) G {^"^f we write 5(0, y) for g{0,yi, . . . ,yk). 

3.1. The Subcritical Range (nr!^ — t- 0). This range is also known as the 
'dust phase', for reasons that will become clearer later, when we discuss 
Cech complexes. We start with the limiting mean. 

Theorem 3.1 (Limit mean). If nr'^ — ;> 0, then for I < k < d, 

'K{Nk,n} = f^k, 



^i(0,y)dy, 



In general, as is common for results of this nature, we cannot explicitly 
compute the ^k- However, when A; = 1, y contains only a single points, and 
so h=l and i?(0,y) = ||y||/2. Therefore, /ii(0,y) = l{||y|| < 2}, yielding 
/ii = 2'^^^ujci J^d f'^{x) dx, where w^ is the volume of the unit ball in M'^. 
Some numerics for other cases are given below. 

The observation that, for a specific choice of r„, there is at most one 
a G [l,(i] such that liTan^^n°'~^^r'^°' G (0,oo) leads to the important fact 
that there is a 'critical' index, kc ■= \_a\ , such that 

(3.3) k <kc^ lim E {N^ n} = 00, k > kc ^ lim E {N^ „} = 0, 

n— >oo ' n-+oo ' 

with any value in (0, 00] possible at k = kc. That is, there is phase transition 
occurring within the subcritical regime itself. Similar regimes, with identical 
limits, appear for asymptotic variances. 

Theorem 3.2 (Limit variance). If nr'^ —?■ 0, then for 1 < k < d, 
lim (n'=+Vf)-iVar(iV ) = ;,,. 

n— >oo 

Not surprisingly, the three regimes also yield different limit distributions. 



(3.1) 






lim (n'^+Vf )-i 

n^oo 


where 








(3.2) 




fJ-k = 


1 / f'^+V' 


(A; + 1)!A/ ^^ 


and hi 


is 


defined 


in (]2.6|). 
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Theorem 3.3 (Limit distribution). Let nr^ — ^ 0, and 1 < k < d, 

1. If linin^oo n'^^^r'^^ = 0, then 

Nk,n ^ 0. 

2. // linin^oo n''+ V^'' = a £ (0, oo), then 

Nk,n -> Poisson (a^fc) . 



3. If Wm.n-^oon^'^^ r'^ = oo, then 

Nk,n-^{Nk,n} C 



AA(0,A/fc). 



As above, for a specific choice of r„, there is going to be at most a single kc 
for which the Poisson limit applies. Otherwise Nk^n converges either to zero 
or infinity. Thus, in the subcritical regime, the picture is that n = No^n ^ 
-^1,™ ^ • • • ^ ^kc,ni while, for k > k^ the value of A^^^^^ will be zero, with 
high probability, which increases with k. 

3.2. The Critical and Supercritical Ranges (nr"^ — )■ A G (0, oo]J. We now 
look at the critical {nr^ — )■ A G (0, oo)) and supercritical {nr'^ — )■ oo) regimes. 
While there are differences between the two regimes, the general outline 
of the results is the same. In both, the correct scaling for Nk^n is n (as 
opposed to n^'^^r'^'' in the subcritical range). Consequently, the limit results 
are similar for all the indices. 

The supercritical regime is significantly more difficult to analyze than ei- 
ther the critical or subcritical, and we shall require an additional assumption 
for this case, which necessitates a definition. 

Definition 3.4. Let / : M'^ — ^ M be a probability density function. 
We say that / is lower bounded if it has compact support and /min := 
inf {/(x) : X G supp(/)} > 0. 

Henceforth, when dealing with the supercritical phase, we always assume 
that / is a lower bounded probability density, and that supp(/) is convex. It 
is not clear at this point if these are necessary conditions, or a consequence 
of our proofs. 

Theorem 3.5 (Limit mean). If nr^ — > A G (0, oo], then, for 1 < k < d, 

lim n-^E{Nk,n} = lk{X), 
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where 

[k + Ij! Jmd^k+l 



7fc(oo) = lim 7fc(A) = —^ [ /.(0,y)e-^^'(0'y) dy, 
A^oo [k + Ij! 



(jjd is the volume of the unit ball in W^, and R, h, and hi are defined in 
(lOD . (I23]l . and ^M, respectively. 



Again, these terms can be evaluated for fc = 1, in which case 

(3.4) 7i(A) = U [ f{x)e-'-^'-'\\y\\'f(^Uydx, 

^ JW- J||j/||<2 

71 (oo) = \\ e-^^''\\y\\' dy = 2^-\ 

For a uniform distribution on a compact set D C M it is easy to show 
that 7i(A) is given by 

(3.5) 7i(A) = 2^^-^! - e^^^-^/VoK^)), 

from which it is easy to check that 71(A) — )• 71(00) as A — )• 00. For higher 
indices, we have no analytic way to compute 7fc(A). However, it can be 
evaluated numerically, and an example is given in Figure [2] for the uni- 
form distribution on [0,1]^. Note that, in that example, 70(00) — 71(00) + 
72(00) — 73(00) ~ 0. This is not a coincidence, and the explanation for this 
phenomenon will be given in Section 14.31 where we discuss the mean Euler 
characteristic of Cech complexes. 

Recall that, in the subcritical phase, the limit mean and the limit variance 
were exactly the same. For other phases, this is no longer true. 

Theorem 3.6 (Limit variance), //nr^ — ^ A G (0, 00] and 1 < k < d, 
lim n~^Var {Nk^n) = (^IW, lim n^^Var (Nk,n) = ^IW, 

where o-^(A) <al{X) < 00. 

The expressions defining o"^(A) and 5^ (A) are rather complicated, and can 
be found at (jS.lOp and ()8.3p . respectively. Note that this theorem, and the 
following central limit theorem (CLT), are the only places where the limit 
values differ between the random sample and Poisson cases. 
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-k=0 

-k=1 

-k=2 

k=3 



Fig 2: The 7fe(A) function. In this example d = 3, and /(x) is the uniform 
density on [0, 1]'^. For k = we know that n~^NQ^n = Ij and for A; = 1 we 
have an explicit formula in (j3.5p . For A; = 2,3 we had to use a numerical 
approximation, hence the noisiness of the graphs. 



Theorem 3.7 (CLT). If nr^ ^ A G (0,oo], then for I < k < d, 



(3.6) ^ ^A/(0,CTfc(A)), — 



n 



n 



MiO,aUX)). 



Note that as an immediate corollary of these CLTs and Theorem 13.61 we 
have the 'law of large numbers' that, under the conditions of the CLTs, 

n~^Nk,n — >7fe(A). 

To conclude this section, we note an interesting result which is unique to 
the supercritical regime, for which we define -/V^„ := \C{n,k)\, the 'global' 
number of critical points of the distance function dx„ in M*^ (i.e. without 
requiring (|CP3p ). We note first that N^^n and Njf^ have identical asymptotic 
behaviors, at least at the level of their first two moments and CLT: 

Theorem 3.8. Ifnr'^ — )■ oo, and f is lower bounded with convex support, 
then, for 1 < k < d, 

Jiim n-^E {ivg} = 7fc(oo), Jim n-^Var (ivg) = al{oo), 
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and 



4i-^ 






As usual, the results are the same for the Poisson case. An obvious corol- 
lary of Theorem 13.81 is that n~^E \ Nff^ — N^^n \ ^ 0- However, much more 
is true: 

Proposition 3.9. Under the conditions of Theorem \3.8l and if nr'^ > 
D*logn, for sufficiently large (f -dependent) D* , then, for 1 < k < d, 



(3.7) lim E 



n— >oo 



Nii - Nk,n 



0. 



Thus, in the supercritical phase, the slow decrease of the radii r„ implies 
that the global and the local number of critical points are ultimately equal 
with high probability, despite the fact that both grow to infinity with in- 
creasing n. This is an interesting and unexpected result, and will turn out to 
be important when we discuss the Euler characteristic of the Cech complex 
in the next section. However, (j3.7p relies heavily on the assumed convexity 
of supp(/). For example, take / to be the uniform density on the annulus 
A = |x G M^ : 1 < |x| < 2|. Then, for n large enough, we would expect to 
have a maximum point (index 2) close to the origin. This critical point will 
be accounted for in A'^^^, but will be ignored by N2^n, since its distance to 

Xn is greater than 1. Thus, we would expect that E{|A'2„ — A'2,n|} -^ 1, 
which contradicts (13. 71) 



4. Random Cech Complexes. As mentioned already a number of 
times, the results of the previous section regarding critical points of the 
distance function have implications for the homology and Betti numbers of 
certain random Cech complexes, and so are related to recent results of [lO| 



and [Ul] . Our plan in this section is to describe these complexes and then the 
connections. We shall assume that the reader either has a basic grounding in 
algebraic topology at the level of [9| or is prepared to accept a definition of 
the k-th. Betti number /?/c = /3fc(X) of a topological space X as the number of 
fc-dimensional 'holes' in X, where a A;-dimensional hole can be thought of as 
anything that can be continuously transformed a fc-dimensional sphere. The 
first Betti number, (3q{X), is merely the number of connected components 
inX. 
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4.1. Cech Complexes and the Distance Function. The Cech complex gen- 
erated by a set of points "P is a simplicial complex, made up of vertices, edges, 
triangles and higher dimensional faces. While its general definition is quite 
broad, we shall suffice with the following special case. 

Definition 4.1 (Cech complex). Let V = {xi,X2, ■ ■ ■} be a collection 
of points in W^. The Cech complex C{V, e) is constructed as follows: 

1. The 0-simplices (vertices) are the points in V. 

2. An n-simplex [xj^, . . . , XiJ is in (7(7^, e) if nfe=o ^<^i^ik) / ®- 

Figure [3] depicts a simple example of a Cech complex in M^. An important 




Fig 3: The Cech complex (7(7^, e), for V = {xi, . . . , xg} C M^, and some e. 
The complex contains 6 vertices, 7 edges, and a single 2-dimensional face. 

result, known as the nerve theorem, links Cech complexes C{V, e) and the 
neighborhood set Upgp Be{p), and states that they are homologically equiv- 
alent. Thus, for example, they have the same Betti numbers. Furthermore, 
both are linked to sublevel sets of the distance function, since it is immediate 
from the definitions that 

(4.1) d^i([0,6]) = {xGM'^: dv{x)<e]= (j B,{p)''°'^=°'''' C{r,e). 

per 

4.2. Critical Points and Betti Numbers. Classical Morse theory, in par- 
ticular the version developed in |7|] that applies to the distance function, tells 
us that, in view of the equivalences in (j4.ip . there is a connection between 
the critical points of d-p over the set d^ ([0, e]), along with their indices, and 
the Betti numbers of C{V, e). As usual, P is a point set in M''. In particular, 
for every critical point of dx„ at height e and of index k, for all small enough 
r], either 

(3k {C{Xn, e + T]))= h {C{Xn, e-ri))+ 1, 



12 BOBROWSKI AND ABLER 

or 

Pk-i {C{Xn, £ + ??))= A-i {C{Xn, e-rj))- 1. 

Despite this connection, Betti numbers, dealing, as they do, with 'holes', 
are typically determined by global phenomena, and this makes them hard 
to study directly in the random setting. On the other hand, the structure of 
critical points is a local phenomenon, which is why, in the random case, we 
can say more about critical points than what is known for Betti numbers to 
date. 

4.3. Random Cech Complexes. For the remainder of this section we shall 
treat only the random sample Af„, although similar statements could be made 
regarding the Poisson case. Retaining the notation of the previous section, 
and defining /3fc „ := l3k{C{Xn-, i"n)), our aim will be to examine relationships 
between the random variables A'^^^^ and the (3k,n and f3k-hxL- In addition, we 



shall compare our results for Nj^^n to those of 10(] and [ll|] for /3k^m using 
Morse theory to explain the connections. 



In direct analogy to the results of Section [3l IQJjlUl show that the limiting 
behavior of C{Xn,rn) splits into three main regimes, depending on the limit 
of nr!^. In the subcritical (rar^ — >■ 0) or dust phase, in which the Cech complex 
consists mostly of small disconnected particles and very few holes, Theorem 
3.2 in [IQ] states that for 1 < A; < d - 1, 

lim(n'=+M(^+i))-iE{/3fc,„} = Dfc, 

n— >cx) 

for some constant Dk defined in an integral form and related to the 7^ of our 



Theorem 13.51 In 1]| the subcritical phase is explored in more detail, and 
limit theorems analogous to those of Theorem 13.31 are proved. Combining 
their results with those in Section [3T| observe that the A^'^^^ and the (3k-i,n 
exhibit similar limiting behavior, and are 0{n'^~^^r^'^). Furthermore, we can 
summarize the relationship between the different A''^^^ and /3k^n as follows: 

iVl,n > N2,n > A^3,n > • • • > A^fc„n 
/3l,n > /32,n > • • • > /3fc,-l,n, 



where ~ means 'same order of magnitude' and kc is as in (j3.3p . For k > kc 
all terms are zero with high probability, which, as before, grow with k. 

Recall that Morse theory tells us that each critical point of index k con- 
tribute either +1 to (3k,n or —1 to /3fc-i,n- Splitting Nk^n accordingly as 
Nk,n = Nj^n + ^kn^ *^^ diagram implies that iV^^ > iV^^. In other words. 
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most of the critical points of index k destroy homology generators rather 
than create new ones. 

For the other regimes, making statements about the Cech complex be- 
comes extremely difficult, and thus the theory is still incomplete. 

In the critical phase {nr'^ — )• A G (0, oo)), the Cech complex starts to 
connect and the topology becomes more complex. In addition, once A passes 
a certain threshold, a giant component emerges (cf. Chapter 9 of J17f|). 
from which comes the alternate description of this phase as the 'percola- 
tion phase'. Theorem 4.1 in fl(}] states that for 1 < k < d — 1, 



n— >oo 



lim n-iE{A,„}G(0,oo), 



although the exact limit is not computed. This agrees with the results in 
Section [32]of this paper. The main difference between the two sets of results 
is that for critical points we are able to give a closed form expression for the 
limit mean of N^^n (Theorem l3.5p . as well as stronger limit results (Theorems 
l3.7H3.9p . This will be useful below, when we discuss Euler characteristics. 

In the supercritical regime {nr!f^ — ?■ oo) even less is known about the 
Cech complex. In general, the Cech complex becomes highly connected, the 
topology becomes simpler and the Betti numbers decrease. Theorem 6.1 of 
[lO| gives the precise results that if / is a uniform density with a compact 
and convex support, and lim„_j.oo(logn/n)~^''^r„ > , then 

(4.2) lim P (/3o,n = 1, /3l,n = • • • = f3d-l,n = 0) = 1, 



which is described in [ll|] by saying that C{Xn,rn) is "asymptotically almost 
surely contractible" . We have no analagous result about critical points, nor 
could we, since Nf^^n is 0{n) and thus A^^^^^ — )• oo (Section 13. 2p . However, 
Corollary 14.21 below gives information about the Euler characteristic of the 
Cech complex which is different from, but related to, (j4.2p . (Note that (j4.2p 
requires that the underlying probability density is lower bounded with con- 
vex support, the same assumption we adopted Section [3^21 ) 

To conclude this section, we present a novel statement about the Cech 
complex C{Xn, rn) which can be made based on the results in Section[3l The 
Euler characteristic of a simplicial complex S has a number of equivalent 
definitions, and a number of important applications. One of the definitions, 
via Betti numbers, is 

oo 

(4.3) x{S) = Y.{-lfPk{S). 

k=0 

However, xi'S) also has a definition via indices of critical points of appro- 
priately defined functions supported on S, and this leads to 
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Corollary 4.2. Let Xn be the Euler characteristic of C{Xn-,fn)- Then 

{1 nr^ — ^ 0, 

l + ELi(-l)'7fe(A) nrj^ ^ AG (0,oo), 
nrj^ — )■ oo. 

Moreover, when nr'^ — t- oo and nrf^ > D* log n (see Proposition \3.9\) . then 

Note that ()4.4p cannot be proven using only the existing results on Betti 
numbers, since the values of the limiting mean in the critical and supercrit- 
ical regimes are not available. This demonstrates one of the advantages of 
studying the homology of the Cech complex via the distance function. 

In closing we note some of the implications of Corollary 14.21 In the sub- 
critical phase, we have that Xn ~ n, which agrees with the intuition devel- 
oped so far that, in this range, the Cech complex consists of mostly small 
disconnected particles and very few holes. In the critical range we have a 
non-trivial limit resulting from the fact that the Cech complex has many 
holes of all possible dimensions. In the supercritical range, Xn ~ 1 which 
is exactly what we get when /3o,n = ^,Pi,n = ■■■ = fid-i,n = (cf. (14. 3p . 
(j4.2p ). Finally, since n~-^E{x„} — )■ in this regime, it is clear now why the 
numerics of Figure [2] showed that '}2k=Q{~^)'^1k{oo) w 0. 

5. Some Notation and Elementary Considerations. The remain- 
ing sections of the paper are devoted to proofs of the results in Sections [3] 
and m and are organized according to situations: sub-critical (dust), crit- 
ical (percolation), and super-critical. In this section we list some common 
notation and note some simple facts that will be used in many of them. 

• Henceforth, k will be fixed, and whenever we use y, y' or 3^j we implicitly 
assume that |3^| = \y'\ = |3^j| = k + 1, unless stated otherwise. 

• Usually, finite subsets of M'^ will be denoted calligraphically {X ,y). How- 
ever inside integrals we use boldfacing and lower case (x, y). 

• For X e M'', X E (M'^)''+^ and y € {W^Y , we use the shorthand 



/(x) 

f{x + r„y) 

/^(0,y) 



f{xi)f{x2)--- f{Xk+l), 

f{x + r„yi)/(x + r„y2) ■■■ f{x + r„yfc), 
/i(0,yi,...,2/fc)- 



The symbol 'c*', denotes a constant value, which might depend on d (am- 
bient dimension), / (the probability density of the samples), and k (the 
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Morse index), but on neither n nor r„. The actual value of c* may change 
between and even within lines. 
• While not exactly a notational issue, we shall often use the facts that, for 
every k, 'n~^(^ — )• l/k\ as n — )• oo, and there is a c* such that (^) < c*!^}^ . 

Lemma 5.1. Let X = {Xi, . . . ,Xk) be a set of k i.i.d. points in M'^ 
sampled from a bounded density f. Then there exists a constant c* such that 

P (Af is contained in a ball with radius r) < c*r ' ^ ' . 

Proof. If X is bounded by a ball with radius r, then X2, ■ ■ ■ , Xf^ are all 
within distance 2r from Xi, thus 

/ \ ^~^ 

P {X is bounded by a ball of radius r) < / f(y)dy f{x)dx 

Jm.'' \JB2r{x) J 

< [ {fm..^o\{B2r{x)f-^ f{x)dx 



where /max := sup^g]gd /(x), and uj^ is the volume of the unit ball in M . D 

6. Means for the Subcritical Range (nr^ — )• 0). We start by prov- 
ing Theorem 13.11 (the limit expectation), which requires the following im- 
portant lemma. Note that the lemma has two implications. Firstly, it gives 
a precise order of magnitude, with constant, for the probability that k + 1 
points in the r„-neighborhood of a point in Xn generate an index-fc critical 
point. Secondly, it imphes that if an additional, high density set of Poisson 
points is added to the picture, the probability that any of these will be in the 
ball containing the A; + 1 original points is of a smaller order of magnitude. 

Lemma 6.1. Let y C Xn, be a subset (chosen in advance) of k + 1 
random variables from Xn, and assume that y is independent of the Poisson 
process Vn- Then, 



hm r-*E{/i,j3;)}= hm r-*E{5.„(3^, ^"0} 

lim r-*E{5,„(3^,3^UP„)} = (A: + l)!^fc. 



Proof. Note that from the definition of /ie(-), it follows that 
/i,(x, X + ey) := he{x, x + eyi, . . . , x + eyu) = /ii(0, y). 
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Thus, using the change of variables x — >• (x, x + r„y), 

^{hr„{y)}= I /(x)/l,„(x)(ix 

7(Md)fe+i 

= rf / fix) fix + r„y)/ir„ (x, x + r„y)(iydx 

(6.1) =rf//(x)/ /(x + r„y)/ii(0,y)dydx. 



Now, for /ii(0,y) to be nonzero, ah the elements yi, . . . ,yk G K must lie 
inside -62(0) - the ball of radius 2 around the origin. Therefore, 

|/(x + r„y)/ii(0,y)| < fL.'^B^wiyi) ' ' ' '^B^miVk), 
and applying the dominated convergence theorem (DCT) to ()6.ip yields 

(6.2) lim / fix + rny)hiiO,y)dxdy = f\x)[ /ii(0,y)dy, 

from which follows 

(6.3) lim r-'"'E{hr„iy)} =[ f^+\x)dx [ /ii(0,y)dy = (fc + l)!^^, 

completing the proof for /ir„(3^)- For 5'r„(3^5'^n) we have 

E{grAy,Xn)} =E{E{grAy,Xn) \ 3^}} = E {hr^yK^ - PiyT''''} , 

where p(>') = f^^^y^ fiz)dz (5(3^) is defined in ([23])). Thus, 

(6.4) E{5.J3^,^„)}= / /(x)/i,„(x)(l-p(x))"-'=-idx 

y(Rd)fe+i 

= ^f / /(^) / /(x + r„y)/ii(0,y)(l-p(x,x + r„y))"-^-idydx. 



The integrand here is smaller or equal to the one in ()6.ip . therefore we can 
safely apply the DCT to it. To find the limit, first note that 

npix,x + Tny) = n / fiz)dz 

J B{x,x+r„y) 

= nVol(S(x,x + r„y)) ^^(-'-+'^"y) ^^'^"^^ 



nuJdirnRiO,y)) 



Vol(S(x,x + r„y)) 

d lB{x,x+r„y) fi^)^^ 

Vol(5(x,x + r„y))' 
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Applying the Lebesgue differentiation theorem yields 

iim /^(;;-r"'^^^'^t =/(^)- 

n-i^oo Yo[[B[x, X + r„y)) 
Therefore, since nr'f^ — )• 0, we have 
(6.5) lim np{x, x + r„y) = 0. 

n— j-oo 

Thus, it is easy to show that 

lim (1 - p{x, X + r„y))"-'=-i = 1, 

n—^oo 

and using ()6.4p and (|6.3p yields 

lim r-*E{5.„(3^,Af„)} = lim r-*E{/i,„(J)} = (A: + 1)!^^. 

ra— )-oo n— )-oo 

Finally, the definition of P^ as a Poisson process with intensity nf{x) implies 

^{gr^{y,yUVn) I 3^} = hrSy)^{B{y)r\Vn = I 3^) = /ir„(3^)e-"P(^). 

Thus, 

E {5r J3^, 3^ U P„)} = E {E {<7,„ (3;, 3; U P„) I 3;}} 

= ^f / /(x) /" /(x + r„y)/ii(0,y)e-"P(^'^+'-"y)dydx, 

Applying the DCT as before, and using (j6.5p and (j6.3p . yields 

lim r-*E{5.j3^,P„)} = lim r-*E{/i,„(3^)} = (A; + l)!/ifc, 

n— >oo n— >cxD 

and we are done. D 

Using the previous lemma, it is now easy to prove Theorem 13. 1[ 
Proof of Theorem I3.1[ First, note that 

Nk,n = ^ gr.a{y,Xn), 

yaXr, 
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where the sum is over all the subsets of size A; + 1. Therefore, 



E{iVfe} = Yl ^{9rAy,^n)} = ( ,^ ': , ]E{gr,SXk+l,Xn)} 



ycXn 



n 
k + l 



Using the fact n ^'^^^'(^^-^ — ?• t^tytt together with Lemma [6TT1 yields 



(fc+i) 
lim(n'=+Vf)-iE{iV,} = ^fc, 

n— >oo 



as required. As for the Poisson case, note first that Nk^n = YlycVn 3"^^ (3^) ^n 
Applying Theorem lA.ll therefore yields that 



E 






{k + l)\ 
where y' is a copy of y independent of Vn- Lemma |6 . 1 1 then implies 

lim(n'=+Vf)-iE|iVfc,4=/«fc, 
as required. D 

7. Variances and Limit Distributions for the Subcritical Range. 

The proofs of Theorems l3.2l and l3.3l split into three different cases, depending 
on the limit of n^^^r'^l^. 

7.1. n "'"^r„ —7- 0. We start with the limit variance for this case. 

Proof of Theorem 13. 2[ 



^{Nln]=E< 



yi(zx„y2(zx„ 



Note that 



= Y.^\ E E 9rSyi,Xn)grAy2,Xn)l{\yir\y2\=j] 

k+l 

:=EJEU.}- 
j=0 

h+l = E 5'r„(3^1,'^n) = Nk^n- 

yicx,. 
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Thus, from Theoreni l3.lt 

(7.1) hm(n'=+Vf)-iE{4+i} = /ifc. 

n— ^-oo 

Next, for < J < /c + 1, if 13^1 n^^al = j and 5r„(3^i,^n)5rj3^2,^n) = 1, 
then necessarily the 2k + 2 — j points in 3^i U 3^2 are bounded by a ball of 
radius 2r„, and using Lemma l5. II we have 

^^^^■^ = G + l) (I + 1 I ]) C i 0^^^-(^i"^")^-(^2"^")>l^in3^.|=, 

Thus, 

(7.2) (n^'+Vf )-iE{/j} < c*(nr^)'=+^--'' ^ 0. 

For j = 0, the sets 3^i and 3^2 are independent, and since 5r„(3^i;'^n) < 
^r„(3^j), we have 

E{5r„(3^1,^n)5rj3^2,^n)} < E { /l,„ (Ji ) /l,„ (3^2 ) } = (E { /l,,„ (3^i ) } ) ^ . 



Therefore, 



K{M = ( . , J ( . , . )E{5rj3^i, A'„)5.„(3^2, A'„)}|3;^n3;,|=o 



n \ /n — A; — 1^ 

<cV(^'+i)(E{/i,J3^)})2. 



Using Lemma |6. 1 1 together with the fact that n '^ rf^ — t- yields 
(7.3) (n^'+VfriE{/o} < c*n^+Vf (r-*E{/i,j3^)})' ^ 0. 

Combining ([TT]), ([T^D, and ([73]) yields 

lim(n^+Vf)-iE{iV2„}=Mfc. 
In addition. Theorem 13.11 implies 

(n'^+Vf )-i(E{iV,,4)2 = n'^+Vf ((n^+Vf )-^E{iVfc,„})' ^ 0. 
Therefore, since varjA'^fc^^} = EA''|^ — (EA^^fc^^)^, we conclude that 

lim(n'=+Vf)-iVar(iV,,„)=/.,, 
n— >-oo 
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which gives Theorem [32] for the random sample case. The proof for the Pois- 
son case (i.e. for N^^n) is similar in spirit, but technically more complicated. 
The main steps of the argument follow. We start by writing 



-{iV2,}=E| Y. E 9rAyi,rn)grAy2,Vn) 



E 

yiCVny2CVn 

fe+1 I 

E^l E Y. 9rAyuVn)gr„iy2,Vn)l{\yiny2\=j} 

j = [yiCPny2CVn 

k+l 



■-T.^{',}' 



j=0 

Again, for j = k + 1 we have 

(7.4) hm (n'^+Vf )~iE \h+A = lim (n'^+Vf )-1e {n^J = f^k- 
For < J < A; + 1 , using Corollary IA.2I we have 

E{/,}=cV=+2-^E{5.„(X,X2UPn)5r„(3^^,X2UPn)}|3;,n3;^|=i- 

Similar arguments to those we used in the previous case then yield that 
lim(n^^r5^ )~^E/j = 0. Furthermore, it is also easy to see that lim(n "''^r*)"^ 
(EA'^fc „)'^ = 0. Thus, we conclude that 

lim (n^+Vf )-iVar (iVfc,„) = ^,k, 

which completes the proof of the theorem. D 

Next, we wish to prove the first part of Theorem 13.31 i.e. that Nk^n -^ 0. 
Proof of Theorem 13.31 - Part 1. Clearly, it suffice to show that 

(7.5) lim E \N?] = lim E \ NL} = 0- 
However, in the previous proof, we saw that 

lim (n'^+Vf )-iE{iV2„} = lim (n'^+Vfr'E {a^m} = /^fe- 
Since n^+^r'^ — )• 0, (j7.5|) follows immediately, and we are done. D 
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7.2. n'^+Vf ^aG (0,oo). 

Proof of Theorem I3.2[ The proof in this case is similar to the pre- 
vious one, the only difference being in how to bound the terms E {Iq} and 
E{/o}- For that, a proof in the spirit of Lemma [6. II can be used to show that 



hm r-''"'^{grSy^,Vn)grAy2,Vn)]\y,r,y,\=, = {{k + l)\l^kY- 
Therefore, 
lim (n^+Vf )-iE{/o} 

n— >oo 

Similarly, using Corollary IA.2| we have 
Iim(n^+Vf)-iE|/o) 



2fc+2 



hm (n^+Vf )- \ ; E{g,jyi,P„HJy2,7^n)}|3;,ny.|=o 



Finally, we also have 



anl. 



lim(n'=+Vf)-i(E{7V,,„}f = lim(n'=+Vf)-i(E|iVfe,4) = "/^I- 



This completes the proof. D 

Next, we prove the Poisson limit of Theorem 13.31 for which we need 

Lemma 7.1. Denote the total variation norm by c/tv- Then 

1. Let Sk,n '■= 'l2ycXn ^r-niy), o-nd let Z ~ Poisson (E{Sk,n})- Then 

dTV {Sk,n, Z) < c^nri 

2. Let Sk,n '■= J^ycVn ^^n('^)' '^^^ ^ ~ Poisson ( E < Sk^n \ ) • Then 

d-TY (Sk,n,Z] < c*nr^. 
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Proof. The proof is very similar to the proof of Theorem 3.4 in |17i ]. and 
uses the Poisson approximation given in Theorem IB. 2[ 
Part 1: Let X„ = {i C {1, 2, . . . , n} : |i| = A; + 1}. Then, for i = {iq, . . . , i^}, 
and Xi = {Xig , . . . , Xj^. } , we can write 

Set A/i = { j G X„ : |i n j| > 0}, and let ~ be a relation on X„ such that i ~ j 
if and only if j G A/i. For '\ ^ '}, X\ and Aj are independent unless j E A/i. 
Thus, the graph (X„,~) is the dependency graph for ,^i := hr„{X{). 

Now, if hr„{X\) / then the k + 1 points in X\ are bounded by a ball of 
radius r„, and using Lemma l5. II we have 

Pi:=EUi}<cVf. 

Therefore, 



i6X„jeM ^ ^ 



n \ f f n \ /n — k — 1 
k+1 ~ [ k+1 



^*^2dk 



n 

where the last inequality uses the facts that rar^ — >■ and k>l. 

Next, if i ~ j with |i n j| = Z > 0, and hr„{X{)hr„{X^) ^ 0, then necessarily 
the 2A; + 2 — / points in X\ U X-^ are bounded by a ball of radius 2r„, and 
therefore, 

Pij:=E{eiCj}<cV52^+i-'). 

Thus, 



k 



s=*E 



2k+2-ld{2k+l-l) 



1 = 1 

„*^k+l„dkf„^d\ 



Finally, using Lemma l6. II it is easy to prove that 
lim(n'=+Vf)-iE{5fc,„} = Mfc, 
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which iniphes that 



1 



Therefore, from Theorem IB.2I we conclude that 

d^y{Sk,n.Z)<c''{nri). 

Part 2: The proof here rehes on the preceeding one, albeit with additional 
technicalities. We start by conditioning on \Vn\, the number of points in Vn- 



Sk.n G A 



Z £A 



(7.6) 



f2 (^(Sk,n^A\ \Vn\=m)-F(z€A))Fi\Vn\=m 



m=Q 

oo 



< X] 1^ (^fc,n eA\ IT'^I = m) - P (Z G ^) I P {\Vn\ = m) 



m=0 



Given \'Pn\ = w-, using the notation in the proof Lemma |7. 11 we can write 

Setting ^i = hr„{Xi), Pi = E{hr^{^i)} and pij = Ejc^i^j}, then, as in the 
proof of Lemma 17.11 it is easy to show that 



(7.7) 

(7.8) 
(7.9) 



ieXm jeM\i 



2k+2-ld{2k+l-l) 

III/ I ^ . 



1 



1=1 



*f„k+ldk\-l 



E 



|'S'fc,n| 



< c*(n'^+Vr) 



Therefore, from Theorem IB.21 we can conclude that 



(^^, „ e A I \V,] = m) - P (Z G ^) I < c*n-(^+i) Xm2^+2-i^d(fc+i-Z)_ 

1=1 
Substituing back into (|7.6p . we have 

dTV {Sk,n,z) < C^n-'=+lX]^n^'^'"'^^{l^"l''^'''} • 

1=1 
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Since \'Pn\ ~ Poisson (n), it is easy to find a constant c* such that 

e{|P„|''+'-'}<cV'=+2-^ 

for every 1 <l <k. So, finally, we have that 

1=1 
since nr'^ — )• and so is bounded. D 

Note that the previous result did not use on the assumption that n'^+^r^'^ — )• 
a G (0,oo). However, to prove an analagous result for A^^^^^ rather than Sk^n 
we shall need it. We shall also need the following two lemmas. 

Lemma 7.2. Let X,Y be integer random variables defined over the same 
probability space, such that A := X — Y > 0. Then dxv (X, Y) <E, {A} . 

Proof. For every A £M (the Borel sets of M), 

\¥{X £ A)-F{Y £ A)\ = \F{X £ A,X j^Y)-F{Y £ A,X ^Y)\ 

= \F{X ^Y){F{X £ A\X ^Y)-F{Y £ A\ X ^Y))\ 
<F{X ^Y) 
= P(A > 1) 
<E{A}. 

Thus, 

dTY{X,Y) = sup\F {X £ A) -F (Y £ A)\ <E{A} 



and we are done D 

Lemma 7.3. LetX ~ Poisson (A^;) , Y ~ Poisson (Ay). Thend^y iX,Y) < 

I Ax — Ay\. 

Proof. Assume that A^ > A^. Let A ~ Poisson (A^ — Ay) be independent 
of y, and define X ■.= Y + A. Then X ~ Poisson (A^), and so dxv (X, Y) = 
(ixv ( ^) ^ ) • Since A = X — y > 0, it follows from Lemma 17.21 that 



dTY[X,YJ <E{A} = A,-Ay, 
and we are done. D 
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Proof of Theorem 13.31 - Part 2. For a start, we need to prove that 
dxY (^fc,n) 'S'fc.n) ^ c*nr^. To this end, define A := Sk^n — ^k,n and note 
that A counts the number of subsets y C Xn for which hr„{y) = 1 but 
9r„{y,^n) = 0. This implies that there exists X G Xn\y for which X £ 
B{y). Thus A is bounded from above by /c + 2 times the number of {k + 2)- 
subsets contained in a bah of radius r„ . From Lemma 15.11 and Lemma 17.21 
we have 

dTv(iVM.^M) <E{A} <c^(^^^2)^:^^'^'^ <c^(n'=+Vf )(nr:J) <c'^(nr^), 

where we used the fact that n '^^r'f^ is bounded. 

Next, if Zj\f ~ Poisson (E {N^^n}) and Zs ~ Poisson (E {Sk^n}) , then from 
Part 1 of Lemma 17.11 and the triangle inequality, 

dxv {Xk,n, Zn) < d"[Y {Nk^n, Sk,n) + ^TV {Sk,n, Zs) + C^TV {^S, Zn) 
<c*{nrt)+dTw{Zs,ZN). 

Finally, Lemma 17.31 implies that 

dTv(^5,^7v) < |E{S,.,„} -E{iVfc,„}| = |E{A}| < c*{nri). 

This completes the proof that dxv {Xk,n, Zn) < c*{nr'^) — )• 0. From Theorem 
13. H since n^'^^r'^ — )• q, we have that 'E{Nk^n} -^ oi^k- Using the fact that 
Zn ~ Poisson (E {Nk^n})-, it is easy to see that (ixv {^k,n, Poisson (afik)) — ^ 
which implies convergence in distribution. 

The proof for the Poisson case (i.e. Nk^n) is exactly the same, other than 
using Part 2 of Lemma l7.ip rather than Part 1.. D 



7.3. n ~^^r'^ — t- oo. This is the most complicated case. We start by prov- 
ing Theorems 13.21 and 13.31 (variance and CLT) for the Poisson case. Then, 
using "De-Poissonization" (Appendix [C]l we treat the random sample case. 

7.3.1. CLT for the Poisson Case. 

Proof of Theorem 13.21 - Part 3 (iVfc.n only). We start with the sec- 
ond moment of Nk n, 
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fe+1 ( 

= T.^\ E Y. 9rSyl.Vn)9rAy2.Vn)^{\ylr^y2\=3} 

fe + 1 

j=0 

As in the proof of Theorem 13.21 we have that 

hm (n*^+Vf )-^E4+i = Hk, hm (n'^+Vf )-1E/,- =0, 1 < j < k. 

n— >oo n— >oo 

However, in this case, Iq requires a different treatment. Recall that our 
interest is in the variance - Var ( Nk,n ) • So we have, 



Var (iV2,) = E [nQ - (e {iV,,„})' 



= E{4+i}+^E{/,} + fE{7o}-(]E{iVfc,„})' 
j=i V 

Thus, to complete the proof, we need to show that 

^lim (n^+Vf )-i (e{/o} - (E{iVfe,„})') = 0. 

Aplying Corollary IA.2I we have 

-,fe+i 



E 



r-1 / n''+^ \ 



where y[ and y!^ are sets of i.i.d. points with density /, independent of Vn, 
and y'12 = 3^1 U 3^2- Similarly, applying Theorem I A. H we have 

Nk,n} = Jj^^r^^{9rAy'l,y'l^Vn)} . 

Therefore, we can write 

(E{Nk,n}y = (^j^^^ K{9rAyiyi^rn)grAy2,y2^K)} , 
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where V'^ is an independent copy of Vn . Set 

Showing that n^^r^ E {A} — t- wih complete the proof. Set 

Ai = A • 1 {B{y[) n Biy'^) / 0} , A2 = a • i {b(X) n B{y!^) = 0} . 

If Ai 7^ then ah the elements in y[ and y!^ are bounded by a ball of radius 
2r„. Therefore, using Lemma |5. II 

E{Ai}<cV5^'=+i). 

Next, note that 

A2 = hrAy[)hrAy2n {^(X) n B{y!,) = 0} 

'1 {T'n n S(X) = 0} 1 {T'n n B(3^^) = 0} 

-i{Vnn B{y[) = 0} 1 {p; n B{y!,) = 0} ) . 



If A 2 7^ 0, then B{y[) and -6(3^2) ^^^ disjoint. Therefore, given J'J and 3^2) 
the set P„ n 5(3^2) is independent of the set Vn H -6(3^0 (by the spatial 
independence of the Poisson process), and has the same distribution as V^t^ 
B{y'2). Thus, E{A2 I 3^(,3^2} = 0> which implies that E{A2} = 0. 
To conclude, E{A} < c*r„^ '. Therefore, 



lim 71^=+ V-'^'^E {A} < lim c%nr'^f+' = 0. 

n— >oo n— >oo 

This completes the proof for the limit variance. D 

Next, we wish to prove the CLT in Theorem | 



Proof of Theorem 13.31 - Part 3 {N^^n only). The proof is based on 
the normal approximation for sums of dependent variables given by Stein's 
method (Appendix [B|). We start by counting only critical points located in 
a compact A C M"^ for which /_^ f{x)dx > 0. For a fixed n, let {Qi,n}jgN ^^ 
a partition of W^ into cubes of side r„ , and let /a C N be the (finite) set of 
indices i for which Qj^„ n ^ / 0. For i G Ia, set 

(7.10) 9^}iy,Vn) := grAy,Vn)lAnQ.JC{y)), 



where C{y) is the critical point in M.'^ generated by y (cf. ()2.2p ). That is, 

era 



Qrl = 1 implies that y generates a critical point located in ^4 n Qi^n- Then 



ycVn 
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is the number of critical points inside A n Qi,„, and 

^k,n •= # {critical points of dp^ inside A} = Yj ^kli- 

ieiA 

First, as in the proof of Theorem 13.21 one can show that 
(7.11) f,,{A) := hm (n'^+Vf )-iVar (n^\ e (0,oo) 



Now, for i,j S Ia, define the relation i ~ j if the distance between Qj^„ and 
Qj^n is less than 2r„. Then (/a,~) is the dependency graph (cf. (jB.ip ) for 

the set < ivy"' > . This follows from the fact that a critical point located 

inside Qi^n is generated by points of Vn that are within distance r„ from 
Qi^n (along with the spatial independence of Vn)- The degree of this graph 
is bounded by 5 . Consider the normalized random variables 






^il-^- 



Var(iV,^„))'^' 

According to Theorem IB. 31 in order to prove a CLT for N^^, all we have to 
do now is to find bounds for E {|^j|^} , p = 3, 4 . 

Let Br„{Qi^n) C U.'^ be the set of points within distance r„ of Qi,„, and let 
Zi := iVn n Br„ iQi,n)\ be the number points of the Poisson process Vn lying 
inside Br„{Qi^n)- Then Z, ~ Poisson (Aj) where Aj = f^ ,q ■.nf{x)dx < 

"'/max(3r„)'^. Thus, Zi is stochastically dominated by a Poisson random vari- 
able with parameter c*nr'^. Now, 

m < M* ^ < c^z'^+K 

Therefore, for any p > 1, 



E 



{|iV« [} < c*K{zf^'^} < c^nrtY^^^^) < c*{nrtY+\ 



since nr^ is bounded (note that each of the c*'s stands for a different value). 
Thus, it is easy to show that also 



E{i^^a-^{^M}r}^-^( 



nrtY+^. 
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Since A is compact, there exists a constant v such that |/yi| < vr^ . There- 
fore, for p = 3,4, 



i€lA 



d^'k ( ly-tryiO, \fZ-\-L 






p/2 



^c*(n'=+Vf)i-P/2 






p/2 



where we used the fact that n^^r'ff — t- oo and the hmit in Theorem 13.21 
From Theorem IB .31 we conclude that 



(7.12) 



nL 



E 



{^t} 



VarfiV,^„ 



1/2 



AA(0,1). 



Now that we have a CLT for N^^, we need to extend it to one for Nj^^^- 
The method we shall use is exactly the same as the one used in p/3], but, 
for completeness, we nevertheless include it. 

Set Am = [-M, M^, A'^^ = W^XAm, and suppose that M is large enough 
such that /^ f{z)dz > 0. Set 



UA) 



nL 



E 



«} 



(^fc+l^dfc)l/2 



Cn 



Nk, 



E 



{Nk,n} 



(^fc+l^*)l/2 



To complete the proof we need to show that |P(Cn ^ t) ~ ^{'t/^/Jj^)\ -^ 
0, where $(•) is the standard normal distribution function. Clearly, (^„ = 
Cn{AM) + CniA^), and from (|7T2]) we have that 

(7.13) Cn{AM)^M{0,IJ.kiAM)). 

For every t E M and M, 5 > we have 

|IP(Cn<t)-$(t/x/M^)| < \P{Cn<t)-F{Cn{AM)<t-6)\ 

IP (UAm) <t-6)- mt - 6)/^fik{AM)) 



(7.14) 
Now, 

P(Cn<i) 



+ 



' (Cn(^Af ) <t-5,Cn<t) + F HUAm) -t\<5,Cn< t) 

+ f{Cn{AM)>t + 6,Cn<t). 
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Note that the first term equals 

P (Cn(^M) <t-6)-F (Cn(^M) <t-6,Cn>t). 



Thus, 



' (Cn < i) - P (CniAM) <t-6)\<F (UAm) <t-5,Cn>t) 
+ P (ICn(^M) -t\<6,Cn<t)+F {UAm) >t + 5,Cn<t) 



< 



{\UA^')\>6)+F{\Cn{AM)-t\<S). 



From Chebyshev's inequahty we have that P {\CniA^^) \ > 5) < J^^Var (Cn(^*^)) • 
From (I7.13p . we have that 



hm P {\UAm) -t\<6) = $((t + d)/^/JIi^{A^) - mt - 6)/y/fikiAM)) 

n— 7>oo 

25 

~ y^2TTfik{AM) 

Therefore, 

hm sup |P (Cn < t) - P (Cn(^M) < t - <5)| < ^^^V^ + 



S^ y'27Tfik{A 



M) 



For e > 0, choose 5 = ey^WJIk/^- Since hmAf^oo fJ-kiAn) = fJ-k, and hmM^oo ^fc(^^) 
0, there exists M large enough such that /ifc(ylAf) > /ifc/2, ^^(A*^) < e(5^/2, 

and also $ ((t - 5)/y^/Xfc(^M)) - $ [t/^/JJ^) < 2e. For this choice of 6, M, 
using last displayed inequality, we have 

limsup |P (Cn < t) - P (CniAM) < t - S)\ < 6. 
n— >oo 

Finally, returning to (|7.14p . there exists A^ > such that for every n > N 

|P(Cn<t)-^(t/^/7^)|<4e. 
This completes the proof. D 

7.3.2. CLT for the Random Sample Case. We shall now return from the 
Poisson case to the random sample one. Our argument will be based on the 
De-Poissonization of Theorem IC.li 

Proof of Theorems 13.21 and 13.31 (A''fc„). Let Dm,n denote the incre- 
ment: 
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In other words, Dm,n is the change in the number of critical points, as we add 
a new point to our fixed-size set. Let 7 be an arbitrary number in (1/2, 1). 
We wish to apply Theorem [Cll with Hn{Vn) = {nri)-^/'^Nk,n and a = 0. 
Thus, we need to prove the following: 



(7.15) 
(7.16) 
(7.17) 



lim sup 

n^oo „_„7<m<n+n^ 



lim sup 

"~^°° n-nT<m<m'<n+nT 

lim sup n 



nri)-^/^¥.{D.m,n} =0, 
(nr^)-'=E{Z)„,„Z)„,,„} 
V2(nr^)-^E{l)i,„}=0. 



0, 



Considering only the cases where gr„{y^'^m) 7^ 9r„(3^>'^m+i)j we can 
write Dm,n = D+^^n - ^m,n' where 

^m,n = #{y ^ Xm+l ■ 9r„(3^, ^m+l) = 1 and Xm+1 G 3^} , 

^™,n = #{ycXra: 5rj3^, ^m) = 1 and X„+i G 5(3^)} . 
We now prove (TTTSll - ffTnTl) . starting with (fTH^ . Note that 

We shall show that the supremum over each of the terms goes to zero. From 
the definition of D^ „ we have that 



"^yigrAy^Xm+l)} < (/'\'''')^{hrAy)}. 



Thus, using Lemma 16.1 



lim sup inrir''/^E{D+J 



< lim (nr^)'^/2 

= 0. 
From the definition of D^ „ we have 

m 



" H''\''']][r;mhrAy)} 



^{D-.,n] = (^^^ jE{5.„(3^,A'^)lB(y)(X„+i)} 

^^ jE{/l,„(3^)l5(3;)(X„+i)} 
<cV + lE{/i,„(3^)l5(3;)(X„+l)}, 
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for some constant c*. Now, 

K{lB(y)iX^+l) \y}= f{x)dx < U^^ujdrt 

JB{y) 

which imphes that 'E.[hr^{y)l.B(y){Xm+i)] < c*r^lE{/ir„(3^)}, and so 
hm { sup {nrir^/^¥.{D;^^^]] 

< hmc^(nr^)'=/2+i(r-*E{/i,j3^)}) 
= 0. 

This proves (j7.15p . To prove (j7.16p we need to show that 



hm sup 

n->oo \n-n-i<m<m'<n+n'y 



{nr<i)-''E{D^^nDm>,n} 



0. 



RecaU that Dm,n = D^^^ — -D^„. Thus we can write Dm,nDm' ,n as a sum 
of four different product terms. We start by looking at the term D+ „-D^/ „• 
Recalhng the definition of D^ „ , we can write this as 

^m+i&y ^m+i&y 

Thus, 

(7.18) e{d+„Z)+,„}< y. E ^{KXy)KAy)]- 

Xm+ieyx^,+^ey 

Now, if \yr^y'\ = j > O, and hrSy)hrAy') = 1, then 3^ U y must be 
bounded by a bah of radius 2r„. This set contains 2k + 2 — j points, so that, 
by Lemma |5.H we have 

If 3^n 3^' = 0, then the two sets are disjoint and independent. Each consists 
of /c + 1 points and must be bounded by a bah of radius r„. Therefore, 

^{hrAy)hrAy')] = {^{hrsy)}? < c^'ri^'. 
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Applying these bounds to (|7.18j) yields 



fc 






^2k-j^d{2k+l~j) 



where we emphasize that each of the appearances of c* represents a different 
value. Multiplying by (nr^)"'' and taking the limit, we obtain 

lim sup (nr;^)~'=E{z)+„i?+ 1=0. 

To handle Z)~ „ recall its definition and write 

ycXm 

< E /ir„(3^)lB(3;)(^m+l). 

Thus, 

(7.19) e{z)-,„D-,„} 

If Xm+1 e y, and 13^ n y\ = J > 0, then yuy'U {X„+i, X„/+i} consists 
of 2/i; + 3 — J points, and for the expression inside the expectation to be 
nonzero all the points must be contained in a ball of radius 2r„. Thus, by 
Lemma 15.11 

E{hr„iy)hr„iy')lBiy){Xm+l)lBiy){X^m'+l)} < C^ ri^^'^'- ^\ 

If X^+1 y, and |3;n y| = i > O, then the set 3^uy U{X™+i,X„/+i} 
consists of 2k + 4 — j points, and therefore, 
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If j = 0, however, then the sets y U {Xm+i} and y' U {Xm'+i} are disjoint 
and independent, each containing A; + 2 points. In addition, we need each of 
this sets to be contained in a ball of radius r„. Therefore, 

^{hrAy)hrAy'nBiy)iXm+l)lBiy')iX„.'+l)} < C^T^^^^^l 

Substituting the above into (J7.19P we have 

k 



- {-™,..^.,„} < ^*E (,: (";-_v ') {' ; 'v?---. 



n 



-*|(;:0(l-::;)Cr''*"^"' 



k 

Effl -t- ril\ f Ti -I- T)^ 
\k^\ )\k-i 



+ n^^^|^n + n^^^^^(2fe+2_,) 



+ ^*E 



^n + n'^\ f n + n^ 



,=0 .A: + Infc + 1-J7 " 



d{2k+2-j) 



k k+1 

i=0 j=0 



2k+2-j d{2k+2-j) 



From the above we can conclude that 



lim sup ("-J^f) ''IE 

"~^°° \n-n'-i <m<m' <n'^n'-i 



{^«.,n^™^n}j=0. 



We shall stop with the computations here. The convergence of the cross- 
products (i.e. D^ n^m' n ^^"^ ^m n^m' n) *^^^ ^^ showu using similar tech- 
niques, and these will prove (|7.16p . The proof of (j7.17p is also very similar. 
Finally, the last condition in Theorem IC.ll requires that 



Hn{X^) = {nri)-'''^ Y, 5r„(3^,A'^)</3(n + 



m] 



y<zx„ 



for some /3 > 0. Using that facts that "^ZycXT^. d^niy i '^m) < (fc+i)' "''"n ~^ 0, 
and n '^^rfl^ — )• oo, we have 

Thus, taking (3 = max(c*,A; + 2) completes the De-Poissonization proof. 
Consequently, we have that both Theorem 13.21 and Theorem 13. 31 hold for the 
fixed-size case as well. D 
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8. The Critical and Supercritical Ranges {nr'^ — )■ A G (0, oo]). 
We start with the expectation computations. The following standard lemma 
is going to play a key role in the supercritical regime. 

Lemma 8.1. Let D CR be a compact convex set with positive Lebesgue 
measure, and let Br{x) C M'^ be the ball of radius r around x. Then there 
exists a constant c* such that for every r < diam(D) and x G D, 

Yol{Br{x)nD) >c*r'^. 

The following Lemma is analogous to Lemma |6. 11 



Lemma 8.2. Let y C Xn, be a set of k + 1 random variables from X^, 
and assume that y is independent of the Poisson process Vn ■ Then, 



hm n'^^igrSy.Xn)) = lim n'=E{5,„(3^,3^ U P„)} = {k + l)\^kW- 



Proof. We shall show the full proof for the Poisson case {gr^ (3^, 3^UP„)). 
The proof for the random sample case is similar. Setting s„ = n~^''^ and 
mimicking the proof of Lemma 16.11 we obtain 

E{5r„(3^,3^UP„)}= / /(x)/i,„(x)e-PWdx 



S 



dk 



fix) fix + Sny)hrAx,x + s„y)e-'^P("'"+^"y)dy(ix 



(8.1) =n-'' [ fix) [ /(x + s„y)/i,„(0,y)e-"P(^'^+^"y)dy(ix, 

where t„ = rn/sn = n^'^^rn- We wish to apply the dominated convergence 
theorem for the last integral. Thus, we need to bound the integrand with an 
integrable expression. 

In the critical range this is done much as in the subcritical range. Since 
nr'^ — )• A < oo, we have that r^ is bounded by some value M. Now, for 
^r„(0, y) to be nonzero, all the elements yi,...,yk G K'^ must lie inside 
S2r„(0) C S2m(0). Therefore, 



fix + s„y)/i.„(0,y)e""P^-'-+^"y^ < /LxlB.M(o)(yi) • • • ^B.^oM), 

and this expression is integrable. 

The last argument cannot be applied in the supercritical range since r„ 
is no longer bounded. This is where we use our additional, lower bounded. 
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assumptions on the /. Since we now have /min > we also have 

p(x) = [ f{z)dz > /^in Vol(i?(x) n SUpp(/)). 

Jb{x) 

If hr^{x) / 0, then necessarily C(x) G conv°(x) and -R(x) < r„ (cf. (|2.6p ). 
In addition, if /(x) / 0, then x C supp(/). Since we assume that supp(/) is 
convex, we have that C(x) € supp(/) as well. Thus, B{x) is a ball centered 
at C(x) G supp(/), with radius -R(x) small enough, and Lemma [8TT] yields 

Vol(5(x) n supp(/)) > c''i?'^(x). 

This can be used to bound the integrand in (|8.1|) . so that 

(8.2) 

= fk p-/minC*iJ'*(0,y) 

J max^ 

Next, note that for i = 1, . . . , A;, i?(0,y) > ||yi|| /2. Thus, 



which implies that the expression in (18.2p is indeed integrable, and so the 
DCT can be safely applied in both regimes. 

Next, we compute the limit of the integral in (18. ip . Note first that 



np{x, X + Sny) =n f{z)dz 

' B{x,x+s„y) 



Yol{B{x,x + Sny)) 
( udrn \\ JBix,x+s„y) /(^)^^ 

nuJd{snR (0,y)) 



^dR^O^y) 



Yol{B{x,x + Sny)) 

JB{x,x+s„y) fi^)dz 



'Yol{B{x,x + Sny))' 
and using the Lebesgue differentiation theorem yields 

lim np{x,x + Sny) = a;di?'^(0,y)/(x). 

n— >-oo 

Taking the limit of all the other terms in ()8.ip we have 
limn>'E{gr„{y,yuVn)}= [ f+\x)hr^{0,y)e-'^'^'''^'>'y^f^^Uydx, 
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where Too = lim„^ooTn. In the supercritical regime, Too = oo, and conse- 
quently /iroc(-) = ^oo(-) = h{-). Thus, 

= (A; + l)!7fe(oo). 
In the critical range, t„ — )• A^''^. Therefore, 

= A'^ / f''+\x)h^r/40, A^/'^z)e-^^''^'(°'^)^(^)dzdx = {k + l)!7fc(A). 

This completes the proof. D 

8.1. Asymptotic Means. Using Lemma 18.21 we can prove Theorem 13. 5i 
Proof of Theorem 13. 5[ For the random sample case we have 



and, using Lemma 18.2 



hm n-'E{Nk,n}= hm fn^^'+'^f, "" ,)) Ue {gr„iy , Xr,)}) = jk{\) ■ 






For the Poisson case, using Theorem I A. H 

and, using Lemma W?2\ 

lim n-^E|iVfc,„|=7fc(A), 

which completes the proof. D 

8.2. Asymptotic Variance - Poisson Case. For the variance and CLT 
results, as in the subcritical phase, we shall first treat the Poisson case. 
Then, using De-Poissonization we shall turn to the random sample case. 
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Proof of Theorem I3.6L {Nk^n only). As in the proof of Theorem [3^ 

Var (^NQ=E{Nk,n} + J2^{Tj} + (e{/o} - (e {iVfc,„})'] , 
where 

E E 9rAyi,Vn)9rAy2,Vn)l{\yiny2\=j}}. 



I,=E 



^4 = ,!((fc + l-,)!)2 ^{g^^(y^yi2U^n)g.„(y^,X2UPn)}|y,,y,|,^. 



From Corollary lA. 21 

For < J < fc + 1 , as in the proof of Lemma 18.21 one can show that 
lim n^(2fc+i-,)jE {5^JX,X2 U Vn)9rAy'2,y[2 U Pn)}|yny I-,- 
= I f''^'-'{x)hr^ (0, yi U Z)hr^ (0, y2 U z) 

Rd(2fe+2-j) 

where x G M'^, y^ G M'^('=+i-^'),z G M'^(^-i), and t^o = lim^^oo^^'^'^^n- 
Therefore, 

Jimn-E{/,}=,i^)(A), 

where 

7i^'^(A) := ^,((^^,_^.),)2 y /^'=+^-^(x)/.i(0,yiUz)/.i(0,y2Uz) 



for A G (0, oo), and 

Mk+i-jW 



^k^ (°°) ■= -uTTTl ^ / f^'-'ixMO, yi U z)/j(0, ys U z) 



;(2fc+2-i) 

-Vol{B(0,yiUz)UB(0,y2Uz))/{a;)^ 



^ g- vol^^^u,ylUZJU^^u,y2UZJ;J^.;;^^^y^^y^^^_ 

It is easy to show that < 7^. (A) < oo for A G (0, oo]. For j = 0, we define 

A:=grAyiy[2^Vn)gr„iy2,y[2^Vn)-grAy'i,y'i^Vn)grAy2,y2^K) 
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so that 

wlT^X—fwJAT, W — , 

((fc + l)!)2 



((- ^ "I \ 2 „2A;+2 



Now set 

Ai = A • 1 {s(X) n B{y!,) / 0} , A2 = A • 1 {5(X) n B{y^) = 0} . 

Then, as m the proof of Theorem 13.21 we can show that E {A2} = 0, and 
hm n2^+iE{Ai}= / f''+\x)hrA0,yi)hr^,{0,y2)l{B{0,y^) n B{z,y2) ^ ^ 

n-i-oo J^d{2k+2) 

^ L-yoliB{0,yi)UB{z,z+y2))f{x) _ g-<^d(fi''{0,yi)+R'^(0,y2))/(x)\ dxdzdyidy2, 

where x,z e R'^, and y^ e (R'^)''. Thus, 

Jim n-i TeI/o} - (e {^m})') = 7r(A), 
where 

. s \2k+l 

7r(A): ' 



((fc + l)!)2 
X / /2'=+2(x)/ii(O,yi)/ii(O,y2)l{i?(O,yi)ni3(O,z + y2)/0} 

for A < 00, and 

X / f''+\x)h{0,y,)h{0,y2)l{B{0,yi)nB{0,z + y2)y^(li} 

^ / -Vol(B{0,yi)UB{^,^+y2))/(x) _ g-<^d(i?''(0,yi)+/?''(0,y2))/(x)\ d^dzdyidy 

To conclude, we have proven that 



2- 



k 



(8.3) hm n-^Var (iVfc,„) = 7fc(A) + Vt^ (A) := ^^(A) G (0,oo), 

as required. D 
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8.3. CLT - Poisson Case. Next, we prove the CLT result in Theorem 
13.71 again using Stein's method, as in the proof of Theorem [ 



Proof of Theorem 13.71 {Nk^n only) . We start again by counting only 
critical points located in a compact set A C W^, with f^ f{x)dx > 0. We 

define Qi^n,Nf^^^,N^^,gr„,{lA,''^) and ^j the same way as in the proof of 
Theorem 13.31 Then, as in the proof of Theorem 13.61 one can show that 

(8.4) lim n^^Var (iV^„) G (0,oo). 



According to Theorem IB.31 in order to prove a CLT for N^^, we need to 
find bounds for E {|^j|^} , p = 3,4 . We start with p = 3. 

E I (N^i - E {se.})n = ± {') (-i)i (e {a^« })'-''e { («i:r ^' 



i=o 



The computation of the bound here is similar in spirit to the ones we used 
in the proof of Theorem 13.21 but technically more complicated, and we shall 
not give details. Rather, we shall suffice with a brief description of the main 
ideas: Every element in the sum can be expressed as the expectation of a 
triple sum of the form 

(8-5) 4 E E E £^(3^i>^i'^)5^2(3^2,p(^))<7«(3^3,7'(=^))|, 
^ yicvL'^ y2cvi?^ yscvL'^ ^ 

where each of the Poisson processes can either be equal to one of the others 
or an independent copy, depending on j. As for E{A2} in the proof of 
Theorem 13.61 we can collect all the terms in which at least one of the balls 
B[yi) is disjoint from the others, and show that they cancel each other. For 
each of the remaining terms, we can show that if |3^i U 3^2 U 3^3| = 3k + 3 — j, 
with < J < 3A; + 3, then the relevant part of the sum in (j8.5p is bounded 
by either c*n^^"'"'^~-'r„ "' or c*'n?'^'^^~^ Sn = c*n. The first bound 

is achieved by requiring that all the points are within distance of r„ from 
the cube Qi^n, and the second bound is achieved using integral evaluations 
similar to the ones used in the proof of Theorem 13.61 

Recall, that \Ia\ < c*r~'^. In the critical range, choose the first bound 
together with the fact that nr'^ is bounded to obtain 

|^^|3| < 'Hn _ = £ ^ ^ 0. 

^^U (Var(7V,^„)) nV^ (n-^Var (iV,^„ 
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In the supercritical range, choose the second bound to obtain 

ieiA (Var (iV^^Jj nr'^ [n-^Yav (^N^„^ 

The proof for p = 4 is similar. Thus, from Theorem IB. 31 we have that 

To conclude the proof, we need to show that the CLT for N^^ implies a 
CLT for Nk^n- This is done exactly as for Part 3 of Theorem 13.31 D 

8.4. CLT - Random Sample Case. To complete the proof of Theorems 
I3.6l and l3.7t we need to show that the same limit results apply to the random 
sample case as well. While we again rely on De-Poissonization, it is worth 
noting that, as opposed to the subcritical range, here the limiting variances 
are different in the Poisson and random sample cases. We start by defining 

[k + ly. Jmd)k+2 



[k + iy. J{Rd)k+2 

where A < oo, x G M'^ and y G (M'^)'=, z G R'^. 

Proof of Theorems 13.61 and 13.71 (Nk^n)- Let Dm,n denote the incre- 
ment: 

Dm,n = 2^ gr„{y,^m+l) - /.^ 9r„iy,^m)- 

Let 7 be an arbitrary number in (1/2, 1). We wish to apply Theorem ICH 
with HniVn) = ]^k,n a^d a = afc(A) := {k + l)7fc(A) — ??fe(A). Thus, we need 
to prove: 

(8.6) lim sup \E{Dm,n}-ak{X)\=0 

(8.7) lim sup \E{D,n,nDn,>,n}-ak{Xf\=0 

"^°° n~n^ <m<m' Kn+n-y 

(8.8) lim sup n-i/2E{D^„} =0 

^^°° n-n"' <m<n+n-y 
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As in the proof of Theorem 13. 3^ write Dm^n = D^ „ — D^^, where 

D^,n = #{3^ C Xm+i ■■ 9r„{y,Xm+i) = 1 and X„+i G y} , 
D^n^n = *{y^Xra: 5r J3^, ^m) = 1 and X„+i G 5(3^)} . 

From the definition of D+ „ we have that 



^{Dt^,n] = {^^^{9rSy^X^+l)} ■ 



Thus, 

i7 



As in the proof of Lemma |8.2| since 7 G (1/2, 1) it is easy to show that 
hm n''^{gr.Sy,Xn±n'<)] = (A: + l)!7fc(A) 

n— ^-oo 

and since \min~^^n~^i^ k ) ~ -'-/^■' ^^ have 

hm ( sup |E{D+J-(A; + l)7fc(A)|') =0. 

Next, from the definition of D~ „ we have, 

(777 \ 
^^^J¥.{grSy,X^)-^B[y){Xm+i)]. 

Note that if X is a random variable in M with density /, independent of 
Af„, then we can replace X^+i with X in the last equality. Thus, we have 

(jl — 77, '^ \ 
^^ jE{<7.„(3^,^„+„.)lB(y)W}, 

(77 — )- 77,'' \ 
^^ jE{<7.J^fc+l,A'„_„.)lB(;,^^^)(X)}. 

In addition, it is easy to show that 

hm n''+^^{grSy.Xn±n^)lB(y){X)] = (fc + l)!r?fc(A). 

Thus, 

lim ( sup |E{D^^„}-7?fc(A)|') =0. 

"■~*'°° \n-n~i <m<n+n~i J 
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Finally, since \Dm,n - afc(A)| < \D+^„ - {k + l)7fc(A)| + |Dm,„ - %(A)|, we 
conclude that (|8.6p holds. 

To prove (18. 7p we need to show that, 



lim sup \E{Dm,nDm',n} -o:k{>'f\] =0- 

jn-oo \n-n'y <m<in' <n+n'' I 

Recall that Dm,n = D^^—D^ „, and so we can write the product Dm,nD„ 

as a sum of four different products. We start with £)+ n^m' n- 

(8.9) 

Now, if |3^ n 3^'| = J > 0, then it is easy to show that 

lim n'^-^W.{grSy,Xrn^-i)grAy',X^'+i)]=^- 

rn-oo 

Therefore, the relevant part of the sum in (j8.9p satisfies 
m\ (m! — A; — l\ /A; + 1^ 






and 
?n,\ /ttt,' — A: — 1\ /A; + 1 



/n + n'>'\ /n + n'> - A; - 1\ /A; + 1\ . , , , , , , 

<y ^ ^y ^_. j(^ . JE{<7.„(3^,A'„+i)5.jy,A',n'+i)}^o 

If 3; n 3^' = 0, then 

]E{5r„(3^,^™+i)5r„(y,;t:m'+i)} > E{5,„(J,A'„+„.)5,„(y,Af„+„.)} , 

IE{<7rj3^,^m+l)5r„(y,^,n'+l)} < E {(7,„ (37, A'„_„.)5,„ (y , A'™.^.)} , 

and it is easy to show that 

lim r?''^{grSy.Xn±ni)9rSy'.?in±ni)] = {{k + ly.JkWf ■ 
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Therefore, the relevant part of the sum in (j8.9p satisfies 

fn — n'\ fn — n"' — k — 1 
-[ k )[ k 

and 
m\ fm! — k — 1 
k)\ k 

fn + n'^\ fn + n"' — k — 1 
-[ k )[ k 

Thus, 



E{grAy,Xn+n-r)9rAy\'^n+n-r)} ^ {{k + ly.^kWf , 



E{gr,Sy,Xm+l)grAy,'^m'+l)} 

¥.{grAy,Xn-n^)grAy',^n~n^)} ^ {{k + l)hk{X)f . 



hm sup 

n-i-oo yn-nf <m<m' <n+n'y 

Similarly, we can show that 



E 



{D^,nDi>,r.}-iik + ^)hk(.X)f 



0. 



lim sup 

"'^'°° \n-n"'<m<m'<n+n'y 



lim sup 

"■~^°° \ n-nT<m.<m'<n+nT 



lim sup 

"~^°° \n-nT<m.<m'<n+nT 






E ^D^ 



0. 



Combining all these limits together shows that ()8.7p holds. Finally, similar 
computation yields (|8.8p . 

For the last condition in Theorem IC-H note that 

Hni^m) < L'^^ < c*m^^^ < c*{n + m)''+\ 

Thus, taking /3 = max(c*, k + 1) completes the De-Poissonization proof, and 
from Theorem lC.il we conclude that a^(A) < ?^(A), and that 



(8.10) 
and that 



i2r\\ „2/\\ . 2r 



lim Var (iVfc,„) = a^(A) - a^(A) := a^(A). 

n— >oo 



n 



which completes the proof of Theorem 13. 7t as promised. 



D 
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The only remaining results in Section [3] that still require proofs relate to 
the global number of critical points. 

Proof of Theorem I3.8[ This theorem is proved exactly the same way 
as Theorems l3.5l[3T6l and 13. 71 are proved in the super-critical phase. The only 
difference is that, throughout, /i(x) replaces /it-„(x). This, however does not 
affect any of the results, since in the limit /i7-„(x) — )• /i(x). D 

Proof of Proposition 13.91 We prove the proposition for the Poisson 
case. The random sample case is similar. 

(IRd)fc+l 

^ ^ f{x)f{x + Sny){h{0,y)-KAO,y))ne-^P^^'^+'"yUydx. 



)k + l 



Since ne~^P — )• 0, applying the dominated convergence theorem yields the 
desired result. All we need to do is to show that the integrand is indeed 
bounded by an integrable expression. As in the proof of Theorem 13.51 (cf. 
()8.2p ). we can show that the integrand is bounded by 

(8.11) /(x)/L,ne-/-^*«(0'y)'. 

Now note that if the integrand is nonzero then h ^ hr„ , and so R{0, y) > Tn- 
Therefore, R{0,yY > l/2{R{0,y)'^ + nr^), and (fSllT) can be replaced by 

Assuming that nr^ > D* log n, with D* = (/minC*/2)~^ then ne"-^™'"^*"'''"/^ < 
1 and we obtain an integrable bound. D 

9. Euler Characteristic Results. Finally, we prove Corollary 14.21 

Proof of Corollary 14. 2[ First note that A'o = n in the random sam- 
ple case, and E < Nk,n ^ = n in the Poisson case. Thus 

d d 

E{Xn}=n + ^(-l)'=E{Arfc,4, E{xn} = n + ^(-1)'=E {iV,,,,} . 

k=l fc=l 
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The first two cases of the theorem are now obvious consequences of Theorems 
13.11 and I3.5i For the third case, using Theorem 13.81 we have 



hm n-\n= lim n-^ V (-1)'=7V^^^. 



However, since N^^ counts all the critical points in W^, Morse theory implies 



k=0 

and we can conclude that lim„^oo n^^Xn = 0. 

If, in addition, r!f^ satisfies the conditions of Proposition 13.91 (i.e. nr!f^ > 
D* logn), then 

d 

= lim y^{-lfW.{N^^l - NkA = 1 - lim Xn, 

fc=0 

which implies that Xn — ^ 1- D 

APPENDIX A: PALM THEORY FOR POISSON PROCESSES 

This appendix contains a collection of definitions and theorems which 
are used in the proofs of this paper. Most of the results are cited from 



17l |. although they may not necessarily have originated there. However, for 



notational reasons we refer the reader to 17|, while other resources include 
[3|, ll9[] . The following theorem is very useful when computing expectations 
related to Poisson processes. 



Theorem A.l (Palm theory for Poisson processes, [l7| Theorem 1.6). 
Let f be a probability density on W^ , and let Vn be a Poisson process on W^ 
with intensity A„ = nf . Let h{y,X) be a measurable function defined for all 
finite subsets y <Z X CW^ with \y\ = k. Then 

k 

ycVn 

where y' is a set of k iid points in M with density f, independent ofVn- 
We shall also need the following corollary, which treats second moments: 
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Corollary A. 2. With the notation above, assuming |3^i| = |3^2| = k, 

2k— i 

ie{ Y1 Kyi,rn)h{y2,Vn)} = -^7l^—ryr^^{h{y[,y[2UVn)h{y'2,y[2UVn)} 

\yiny2\=j 

where y[2 = 3^i U 3^2 ^-^ ^ ^^^ ^f 2^ — j Hd points in W^ with density f{x), 
independent ofVn, and |3^{ n 3^2! — J- 

Proof. Given \Vn\ = rn, the sum on the LHS is finite. Therefore, 
(A.l) e{ Y. Kyi,T^n)h{y2,Vn)\Vn\ = m^ 

yi,y2CVn 
\yiny2\=j 



2r-j)Ck ')Q)E{M3^1,^n)M3^2,Pn)| \Vn\=m}^y^ 



ny2\=j 



Choosing now all possible subsets y of size 2k— j, and splitting each of them 
into two arbitrary subsets 3^i,3^2 of size k with |3^i n 3^2! = j, yields 

(A.2) e{ Y1 Hyi,'Pn)h{y2,Vn)\rn\ = m} 



ycVn 
\y\=2k~j 



\{h{yi,Vn)h{y2,Vn) I \Vn\=m]^y^, 



m 

\ m' J A/ M. 'I? \h>i a;_ -i? \ I \-iy 1 — ^^ i 



.2fe - j, 
Combining (jA.ip . ()A.2p . and Theorem lA.il for subsets y of size 2k — j yields, 



E 



{ Y. Kyu'Pn)h{y2,Vn)} 



yi,y2cv„ 
\y1ny2M 



^\ ^')(^)ie{ Y1 HyuVn)h{y2,Vn)} 

\y\=2k-j 

^{h{y[,y[2^Vn)h{y'2,y[2^Vn)] 



1^2 



jm-m 

where 3^(2 = 3^i U 3^2 is a set of 2k — j iid points in W^ with density f{x), 
independent of Vn, and |3^i n 3^2! = J- ^ 
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APPENDIX B: STEIN'S METHOD 

In this paper we heavily used Stein's method to derive hmit theorems for 
the sums of dependent Bernouhi variables. We need both the Poisson and 
normal approximations, which are presented below. 

Definition B.l. Let {I,E) be a graph. For i,j G / we denote i ~ j if 
{i,j) G E. Let {^i}jg/ be a set of random variables. We say that (/, ~) is a 
dependency graph for {^j} if for every Iif] I2 = 0, with no edges between 
/i and I2, the set of variables {^i}ie/i ^® independent of {Ci}i^j^- We also 
define the neighborhood of i as Mi = {i} L) {j £ I : J ~ «}. 

Theorem B.2 (Stein's Method for Bernouhi Variables, Theorem 2.1 in 
[l7!]). Let {CiJie/ ^^ ^ ^^^ of Bernoulli random variables, with dependency 
graph (/, ~). Let 

p^■.= E{C^}, p^J■.= E{C,CJ}, A:=J]p„ 1^:=^^^, Z ~ Poisson (A) . 
Then, 



m 



dTv(VF,Z)<min(3,A-i)(J^ J^ p,j + Y,Y.P^Pj 

Theorem B.3 (CLT for sums of weakly dependent variables. Theorem 2.4 
ITJl). Let {S,i)iei be a finite collection of random variables, withE,{S,i} = 



0. Let (/, ~) be the dependency graph of {^i)i<^i, and assume that its maximal 
degree is D — 1. Set W := J2i£i ■?«; ^'^^ suppose that E {l^^} = 1. Then for 

all w £R, 



where Fw is the distribution function of W and $ that of a standard Gaus- 
sian. 

APPENDIX C: DE-POISSONIZATION 

Recall that the results in this paper apply to both fixed size sets Xn and 
for Poisson processes Vn- In some cases it is easier to prove the results for 
Vn first, and then conclude that similar results apply to X^. The second step 
is known as 'De-Poissonization', and our use of it will depend primarily on 
the following theorem. 
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Theorem C.l (De-Poissonization, Theorem 2.12 in [IS]). For every n G 
N, let Hn{X) be a functional defined for all finite sets of points X C W^ . Let 
Vn CL Poisson process defined the same way as in Section\^ such that 

n Wax [Hn\Pn)) ^ cr and ;= >J\{\),a), 



as n —7- oo. Define 



Rm,n '■— Hn{Xm+l) — Hn[X„ 



where Xm is defined as in Section [3 In other words, Rm,n measures the 
change in the value of the functional Hn as a single point is added to the 
random set. Supposed that there exist a G M and 7 > 1/2 such that, 

(C.l) lim ( sup \¥.{Rra,n]-a\]=Q 

"^°° \n-n'y <m<n+n'y J 

(C.2) lim sup \& {Rm,nRm' ,n] - 

"^°° \n-ni <m<m' <n+n-' 



(C.3) 



lim ( sup n-i/2E{ii^„} ) =0, 

rn-oo y^-n-y <m,<n+n~i ' J 



and also that there exists /3 > such that \Hn{Xm)\ < /3{n + m)" (a.s.). 
Then c? < a"^, and as n ^- c«, 

-Ur (u IVW . 2 2 , Hn{Xn) -^{Hn{Xn)] D .r, 2 2\ 

n War [Hn[Xn)) ^ a —a and ■= >J\l{0,a —a). 



'n 
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